SFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy

Authors

Ali Reza Khanteymoori University of Zanjan Department of Computer Engineering Zanjan, Iran

Jamshid Pirgazi University of Zanjan Department of Computer Engineering Zanjan, Iran

Abstract:

In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification. Therefore, selection of the appropriate genes is important in bioinformatics and machine learning. The proposed method combines the advantage of wrapper and filter methods for gene subset selection. SFLA-FS consists of two phases. In the first phase a filter method is used for gene ranking from high dimensional microarray data and in the second phase, SFLA is applied to gene selection. The performance of SFLA-FS evaluated for cancer classification using seven standard microarray cancer datasets. Experimental results are compared with those of obtained from several existing well-known gene selection algorithm. The experimental results show that SFLA-FS has a remarkable ability to generate reduced size of genes while yielding significant classification accuracy in cancer classification.

full text

similar resources

sfla based gene selection approach for improving cancer classification accuracy

in this paper, we propose a new gene selection algorithm based on shuffled frog leaping algorithm that is called sfla-fs. the proposed algorithm is used for improving cancer classification accuracy. most of the biological datasets such as cancer datasets have a large number of genes and few samples. however, most of these genes are not usable in some tasks for example in cancer classification. ...

full text

Improving Cancer Classification Accuracy Using Gene Pairs

Recent studies suggest that the deregulation of pathways, rather than individual genes, may be critical in triggering carcinogenesis. The pathway deregulation is often caused by the simultaneous deregulation of more than one gene in the pathway. This suggests that robust gene pair combinations may exploit the underlying bio-molecular reactions that are relevant to the pathway deregulation and t...

full text

A Novel Scheme for Improving Accuracy of KNN Classification Algorithm Based on the New Weighting Technique and Stepwise Feature Selection

K nearest neighbor algorithm is one of the most frequently used techniques in data mining for its integrity and performance. Though the KNN algorithm is highly effective in many cases, it has some essential deficiencies, which affects the classification accuracy of the algorithm. First, the effectiveness of the algorithm is affected by redundant and irrelevant features. Furthermore, this algori...

full text

Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest

Background & objective: Microarray and next generation sequencing (NGS) data are the important sources to find helpful molecular patterns. Also, the great number of gene expression data increases the challenge of how to identify the biomarkers associated with cancer. The random forest (RF) is used to effectively analyze the problems of large-p and smal...

full text

Improving Classification Accuracy Using Gene Ontology Information

Classification problems, e.g., gene function prediction problem, are very important in bioinformatics. Previous work mainly focuses on the improvement of classification techniques used. With the emergence of Gene Ontology (GO), extra knowledge about the gene products can be extracted from GO. Such kind of knowledge reveals the relationship of the gene products and is helpful for solving the cla...

full text

My Resources

Save resource for easier access later

Save to my library Already added to my library

{@ msg_add @}

Journal title

International Journal of Modeling, Identification, Simulation and Control

volume 47 issue 1

pages 1- 8

publication date 2015-05-22

unfollow

{@ msg @}

By following a journal you will be notified via email when a new issue of this journal is published.

Keywords

Bioinformatics Cancer Classification gene Selection SFLA Microarray Data

Hosted on Doprax cloud platform doprax.com